Speech Data Compression for Embedded Systems
نویسنده
چکیده
The main concern of this paper is speech data compression for low-cost embedded systems such as voice-related toys or devices with interactive sound-responses. We use a PC to generate and compress 8-bit-speech-data that has various features such as human speech, symphony and animal songs; the compressed data are then transferred to a masked-ROM. An Intel 8051 embedded chip is employed to expand the compressed speech data stored in a masked-ROM with size ranging from 1 8 Mega Bytes. No RAM ( read/write memory ) is available in the system ( to reduce production cost 1. The ROM also stores the program that is responsible for expanding the compressed data. Since the compression is done off-line, we use simple vectorquantization techniques to perform a first stage lossy compression of the data; the lossy-compressed data are then treated by a second stage lossless compression. Because of the the lack of RAM in the expansion stage, an LZ compression technique cannot be applied. Instead, we use a runlengthHuffman encoding scheme coupled with a unary-code to perform the lossless compression. Also, because of the slow processing speed of the processor in the expansion stage, a length-limited Huffman code must be used. Instead of using the common package-merge[ll algorithm to obtain the lengthlimited code, we develop a technique that uses splaying to achieve the goal.
منابع مشابه
The Cerevoice Blizzard Entry 2007: Are Small Database Errors Worse than Compression Artifacts?
In commercial systems the memory footprint of unit selection systems is often a key issue. This is especially true for PDAs and other embedded devices. In this years Blizzard entry CereProc R ©gave itself the criteria that the full database system entered would have a smaller memory footprint than either of the two smaller database entries. This was accomplished by applying speex speech compres...
متن کاملSpeaker-specific retraining for enhanced compression of unit selection text-to-speech databases
Unit selection based text-to-speech systems can generally obtain high speech quality provided that the database is large enough. In embedded applications, the related memory requirements may be excessive and often the database needs to be both pruned and compressed to fit it into the available memory space. In this paper, we study the topic of database compression. In particular, the focus is o...
متن کاملCompression of speech database by feature separation and pattern clustering using STRAIGHT
This paper presents an alternative solution for speech database compression aiming at the embedded application of concatenative synthesis systems. The waveform of a speech segment is firstly decomposed into a prosodic pattern and a spectral pattern by STRAIGHT – a powerful speech analysissynthesis algorithm. Then all the prosodic and spectral patterns are clustered respectively to remove the re...
متن کاملSpeech Compression using Analysis by Synthesis
Linear prediction plays a fundamental role in all aspects of speech. Its use seems natural and obvious since for a speech signal the value of its current sample can be well modeled as a linear combination of its past values. Calculation for predictor coefficients with the help of automatic code generation gives the solution for early and efficient computing. Automatic code generation is a fast ...
متن کاملTypes-Based Data Embedding for ITU G
In this paper, we expand on a new technique for embedding digital information into G.711 encoded speech signals. Using the method of types as a tool to analyze the statistical nature of a digital speech signal, we demonstrate that our embedding scheme is capable of embedding up to 1.6 Kbps of additional information at an average embedded error rate of 10. We are able to achieve these embedded r...
متن کامل